Metadados

Fechar
Metadados

%0 Journal Article
%4 sid.inpe.br/mtc-m21c/2018/05.04.18.30
%2 sid.inpe.br/mtc-m21c/2018/05.04.18.30.39
%@doi 10.1007/978-3-319-77028-4_82
%@issn 2194-5357
%T Investigating the recognition of non-articulatory sounds by using statistical tests and support vector machine
%D 2018
%9 conference paper
%A Souza, Francisco Carlos M.,
%A Souza, Alinne C. Corrêa,
%A Nakamura, Gilberto M.,
%A Soares, Marinalva Dias,
%A Mandrá, Patrícia Pupin,
%A Macedo, Alessandra A.,
%@affiliation Universidade de São Paulo (USP)
%@affiliation Universidade de São Paulo (USP)
%@affiliation Universidade de São Paulo (USP)
%@affiliation Instituto Nacional de Pesquisas Espaciais (INPE)
%@affiliation Universidade de São Paulo (USP)
%@affiliation Universidade de São Paulo (USP)
%@electronicmailaddress fcarlos@icmc.usp.br
%@electronicmailaddress alinne@icmc.usp.br
%@electronicmailaddress gmnakamura@usp.br
%@electronicmailaddress mdiasoares@gmail.com
%@electronicmailaddress ppmandra@fmrp.usp.br
%@electronicmailaddress ale.alaniz@usp.br
%B Advances in Intelligent Systems and Computing
%V 738
%P 639-649
%K Delayed speech development · Speech recognition methods · Machine learning · Automatic speech recognition.
%X People with articulation and phonological disorders need training to plan and to execute sounds of speech. Compared to other children, children with Down Syndrome have significantly delayed speech development because they present developmental disabilities, mainly apraxia of speech. In practice, speech therapists plan and perform trainings of articulatory and non-articulatory sounds such as blow production and popping lips in order to assist speech production. Mobile applications can be integrated into the clinical treatment to transcend the boundaries of clinics and schedules and therefore reach more people at any time. The use of artificial intelligence and machine learning techniques can improve this kind of application. The aim of this pilot study is to assess speech recognition methods prioritizing the training of sounds for speech production, particularly the non-articulatory sounds. These methods apply Mel-Frequency Cepstrum Coefficients and Laplace transform to extract features, as well as traditional statistical tests and Support Vector Machine (SVM) to recognize sounds. This study also reports experimental results regarding the effectiveness of the methods on a set of 197 sounds. Overall, SVM provides higher accuracy.
%@language en
%3 souza_investigating.pdf

Fechar